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ABSTRACT 


The  main  problem  considered  is:  Given  a  set  of  linear 
inequalities 

(1.1)  Ax  +  By  >  d  , 

wh.ch  defines  a  set  of  (x;y)  ,  find  and  concisely 
define  a  set  Y  of  y  such  that  if  (x;y)  solves 
(1.1)  then  y  belongs  to  Y  and,  conversely,  if  y 
belongs  to  Y  then  there  exists  an  x  such  that 
(x;y)  solves  (1.1). 

The  solution  to  this  problem  involves  finding  the  set 
of  all  extreme  rays  of  the  convex  cone 

wA  -  0  ,  w  >  0 

and  a  method  is  given  for  this.  The  method  is  compared 
with  other  methods  for  finding  extreme  rays  and  points 
and  finally  some  practical  applications  are  given. 


PROJECTIONS  OF  CONVEX  POLYHEDRAL  SETS 


by 

David  Kohler 


1.  INTRODUCTION 

Our  main  concern  will  be  with  the  problem: 

Given  a  set  of  linear  inequalities 

(1.1)  Ax  +  By  >  d  , 

which  defines  a  set  of  (x;y)  ,  find  and  concisely  define 
a  set  Y  of  y  such  that  if  (x;y)  solves  (1.1)  then 
y  belongs  to  Y  and,  conversely,  if  y  belongs  to  Y 
then  there  exists  an  x  such  that  (x;y)  solves  (1.1). 

(A  ,  B  and  d  are  assumed  to  be  real  arrays  with 
dimensions  A  ,  B  ,  d  and  m  ,  n  and  p  are 
assumed  to  be  finite.) 

If  we  find  the  set  Y  then  we  say  that  we  have  "eliminated"  x  from  the 
system  (1.1). 

The  title  of  this  thesis  reflects  the  fact  that  a  convex  polyhedral  set 
is  by  definition  a  set  describable  by  a  finite  system  of  linear  inequalities 
and  the  elimination  of  x  amounts  in  effect  to  the  projection  of  a  set  in 
(x;y)-space  onto  y-space.  Practical  problems  which  reduce  themselves  to  the 
above  problem  abound  in  operations  research  and  in  other  fields,  and  we  will 
discuss  some  of  them  in  Section  8. 

We  shall  develop  the  theory  by  proving  in  Section  3  that  the  set  Y 
can  be  defined  by  the  system  consisting  of  all  of  those  inequalities  in  y 
which  are  nonnegative  linear  combinations  of  the  inequalities  of  the  original 
system  (1.1).  That  is  to  say,  Y  may  be  defined  by  the  system  of  all 
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inequalities  of  the  form 

(1.2)  wAx  +  wBy  >  wd  ,  w  e  C 

where  C  *  (w  |  wA  ■  0  ,  w  >  0}  .  If  the  only  element  in  C  is  w  =  0  then 

y  is  clearly  unrestricted.  Otherwise  C  is  infinite — but  we  prove  in 
Section  A  that  there  is  a  finite  subset  G  of  C  such  that  Y  may  be 
defined  by  the  system  of  all  inequalities  of  the  type 

(1.3)  wAx  +  wBy  >  wd  ,  w  e  G  . 

In  Section  5  we  give  a  method  for  finding  G  (and  simultaneously  the  system 

(1.3) )  and  in  Section  6  we  give  some  computational  results.  It  will  become 
clear,  as  the  tneory  of  Sections  3,  A  and  5  is  developed,  that  the  sets  C 
and  G  are  dependent  only  on  A  and  that  (1.3)  is  the  most  concise 
definition  of  Y  which  we  can  give  if  we  have  no  prior  knowledge  of  B  and 
d  . 

But  before  we  can  develop  this  theory  we  must  describe  the  Fourier- 
Motzkin  Elimination  Method.  This  has  historical  interest  because  it  is  to  my 
knowledge  the  only  method  which  is  currently  available  and  we  must  explain 
why  it  is  not  often  used.  However,  the  main  reason  for  introducing  it  so 
early  is  because  it  is  most  useful  as  a  tool  for  motivating  and  proving  the 
theory  of  Sections  3  and  5. 
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2.  THE  F0UR1ER-M0TZKIN  ELIMINATION  METHOD 

Fourier  first  discovered  this  method  and  described  it  in  1824  in  the 
paper  "Solution  d'une  Question  Particuliere  du  Calcul  des  In£galit£s"  [7J. 
Later  T.  S.  Motzkin  drew  attention  to  the  method  in  his  Doctoral  Thesis 
"Beitrage  zur  Theorie  der  Linearen  Ungleichungen"  [11)  (this  is  in  German  but 
a  translation  was  made  by  D.  R.  Fulkerson  [11]).  A  more  accessible  account 
may  be  found  in  Dantzig  [5]. 

Essentially  the  method  eliminates  the  unwanted  variables  x^,X2»  ...  one 

by  one  until  linear  inequalities  in  y  alone  remain.  It  is  only  necessary  to 

describe  the  elimination  of  one  variable — since  the  elimination  of  the  other 

variables  is  just  a  repetition  of  this  procedure — and  we  will  select  x^  for 

this  purpose.  During  the  elimination  of  x^  the  variables  y^^,  •••»  yp 

do  not  play  a  role  which  is  conceptually  different  from  that  of  the  variables 

xotx_,  ...,  x  ,  It  complicates  the  expressions  to  include  them  explicitly 
Z  j  n 

and  so  we  will  ignore  the  y  variables  and  describe  the  elimination  of  x^ 
from  the  system 


'all 

al2 

•  •  •  •  3. 

In 

X1 

dl 

(2.1) 
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We  partition  the  set  of  row  indices  as  follows 


u 

1  ail 

>  0} 

u 

1  ail 

<  0) 

u 

1  ail 

**  0} 

and  examine  separately  the  two  cases 
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(A)  I+  or  I  or  both  are  empty  sets. 

(B)  Both  I*  and  I  are  nonempty. 

Case  A 

Here  the  elimination  of  x^  can  be  effected  very  easily,  thus: 

The  set  of  feasible  (x2,x,j,  ....  x^)  may  be  defined  by  the  system  of 
inequalities 

(2,2)  ai2x2  +  ai3X3  + . +  ainXn  -  di  ’  V  i  e  I 

because  if  (x^,x2,  ....  xr)  solves  (2.1)  then  it  solves  (2.2)  and, 
conversely,  if  (x.,x_,  ...,  x  )  solves  (2.2)  and  if  I+(I~)  is  nonempty 
then  we  can  always  choose  an  x^  so  positively  (negatively)  large  that  (2.1) 
is  satisfied  by  (x^x^x^,  ....  xr)  . 

We  draw  special  attention  to  the  fact  that  the  set  of  feasible 
(x^x^ . xfl)  is  unrestricted  if  1°  is  empty. 

Case  B 

In  this  case  we  proceed  first  by  reformulating  the  statement  of  the 
inequalities  (2.1).  If 

*ilxl  +  a12x2  + . +  alnxn  i  di 

is  an  inequality  of  (2.1)  and  if  i  c  I+  then  we  reformulate  it  as 

-ai2x2  'ai3x3  ~ainxn  .  dl 
x,  > - . . .  -  + -  , 

1  "  ail  ail  ail  aii 

and  let  1^  represent  the  expression  on  the  R.H.S.;  alternatively  if  i  t  I 


we  reformulate  it  as 
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"ai2X2  “ai3X3 


ail  ail 


~amxn  + 

ail  ail  "  1 


and  let  represent  the  expression  on  the  L.H.S. 

Thn«s  (?.}  >  mqv  Vip  rp'st’afpd  ric 


(2.3) 


X1  ^  Xi 


Ui  £  X1 


ai2x2  +  ai3X3  +  +  ainXn  -  dl  * 


V  i  e  I+ 

V  i  E  r 

V  i  e  1° 


It  is  now  cxear  that  we  can  describe  the  set  of  feasible  (x2,x^,  •••,  xfl) 
after  the  elimination  of  x^  ,  by  the  following  inequalities 


(2.4) 


ui  > 

k  -  i 


,  V  i  e  I+  and  k  e  I 


'aL2x2  +  ai3X3  +  +  ainXn  >  ^  ,  V  i  e  I 


That  is,  any  set  of  (Xj.x^,  ....  xr)  which  s?cisfies  (2.1)  also  satisfies 

(2.4).  Conversely,  if  •••»  x  )  solves  (2.4)  then  we  can  always  choose 

an  x,  so  that  (x, ,x„,  ...,  x  )  solves  (2.1).  The  reason  for  the  latter  is 
1  Li  n 

that  the  inequalities  (2.4)  ensure  that 


Max  (1.)  <  Min  (u,) 

+  1  -  x 
iel  kel 


and  so  any  x^  in  the  interval 


Max  (1.)  ,  Min  (u.) 

+  1  -  x 

Liel  kel  J 


Min  (u^)|  will  satisfy 


X1  -  Uk  ’ 


V  k  E  I' 


and 


li ;  xi 


V  iel  . 
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This  completes  the  description  of  the  elimination  of  x^  from  (2.1). 

The  elimination  of  x  from  the  system  (1.1)  simply  involves  repeating  this 
process  for  x^x^,  . . . ,  xr  in  turn.  (Of  course  no  significance  is  attached 
to  the  sequence  of  elimination  of  the  unwanted  variables.  Any  ether  sequence 
of  elimination  would  serve  the  purpose  equally  well — but  note  that  different 
sequences  may  yield  different  sets  of  inequalities  defining  Y  .)  One  last 
point  is  worth  emphasizing.  It  is  this:  if  nr  inequalities  are  generated  by 
the  elimination  of  one  of  the  unwanted  variables  then  this  immediately  implies 
that  the  set  Y  is  unrestricted  and  that  we  need  not,  indeed  cannot,  perforin 
any  subsequent  eli  _.iations. 

An  example  will  illustrate  the  general  method.  We  wish  to  eliminate  x^ 
and  x2  from  the  system 

-  *1  +  x2  +  n  ;  1 

-  x2  +  2yx  »  -1 

-2xx  +  4x2  +  3y1  >  1 

-  3x2  -  4yx  >  -1 

(2.5)  -  2x2  -  5y1  >  1 

5x2  -  6yi  >  -1 

4x2  "  7?i  -  "2 

8y  1  >  4 
“  9*i  S  -2 

The  first  column  is  all  nonpositive.  So  the  elimination  of  x^  gives: 


-  x2  +  2*l  i  _1 

-  3x2  -  Ayx  >  -1 

-  2x2  -  5yx  >  1 

(2.6)  5x„  -  6y  >  -i 

2  1  ■ 

4x2  -  7yx  >  -2 

8yx  >  4 

-  ^  i  -2 

We  reformulate  this  as 


(2.7) 


4  ^1 

3  yl  3  “ 


X2  - 


X2  2 


2  yx  +  i  >  x2 
x„ 


5  1  s 

2  yl  "  2  2 


6 

5 

7 


5  yl 


A  '1 


y,  - 


8y  i  >  4 


-  9yx  >  -2 


and  the  elimination  of  x2  gives 


(2.8) 


2  y,  +  1 
1 


"  3  yl  +  3 


A 

3 

5  1 

2  yl  ”  2 

2  yx  +  1 

4  4.  1 

3  yl  + 

3 

2  yl  " 


3 

1 

2 

8y, 


5  yl  “ 


5  yl  - 


5  yl  " 


6 

5 

6 

5 

6 
5 
7 
A 
7 

A  yl  - 
7 

A  yl  - 


=  A  *1 


-  9yx  >  -2 


|ro  -f> jto  ui|h-*  ui|h  v-n | ►—* 
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Or,  putting  the  variables  on  the  L.H.S.  and  the  constant  terms  on  the  R.H.S. 
we  have 


(2.9) 


4 

5 

yl 

> 

m 

6 

“  5 

38 

v 

> 

8 

15 

yl 

“  15 

37 

10 

yl 

> 

m 

3 

10 

1 

4 

yl 

> 

3 

"  2 

37 

> 

5 

12 

yl 

n 

”  6 

17 

4 

yl 

> 

0 

8 

yl 

> 

c 

4 

-  9 

yl 

> 

-  2 

The  main  problem  which  occurs  when  using  this  method  is  that  Case  B  seems 
to  occur  more  frequently  than  Case  A  in  practical  problems.  In  Case  B  if  m+ 
and  m  are  the  numbers  of  Indices  in  I+  and  I  respectively  then  the 
number  of  inequalities  after  elimination  is 

+  —  ,  +  — V 

m  •  m  +  (m  -  m  -  m) 


or  an  increase  of 

(m+  -  1)  (m  -  1)  -  1  . 

Thus  the  number  of  inequalities  generated  by  the  elimination  of  a  variable 
can  be,  and  usually  is,  much  larger  than  the  number  before  elimination.  After 
a  few  eliminations  the  number  usually  becomes  impossibly  large,  even  when  m 
is  quite  small.  For  example,  if  A  is  the  matrix 


then  the  elimination  of 


“l  ]  6 

1-4  4 

1  -11  10 
-1  7-6 

-1  3-7 

-1-8  5 

.-1  -  9  1 


generates  12  inequalities, 
the  elimination  of  X2  generates  27  inequalities  and 
the  elimination  of  x^  generates  126  inequalities. 

However,  the  Fourier-Motzkin  Method  has  other  uses.  In  particular  we 
can  make  the  following  deductions  from  it: 

(1)  The  Fourier-Motzkin  Method  demonstrates  something  which  many  people 
would  regard  as  intuitively  reasonable — namely  that  the  set  Y  of 
feasible  y  can  always  be  defined  by  a  finite  set  of  linear 
inequalities  in  y  alone;  or,  in  other  words,  the  projection  of  a 
convex  polyhedral  set  onto  a  vector  subspace  is  itself  also  a 
convex  polyhedral  set. 

(2)  The  inequalities,  if  any,  which  are  generated  by  an  elimination  are 
nonnegative  linear  combinations  of  the  original  inequalities.  In 
Case  A,  of  course,  this  is  obvious  since  the  inequalities  after 
elimination  are  some  of  the  original  ones.  In  Case  B  it  can  be 
demonstrated  easily  by  comparing  (2.3)  with  (2.4).  Reformulating 
(2.3)  as 


(2.10) 


*1 ;  h  • 

-x,  >  -u 


ai2X2  +  ai3X3  + 


1  -  ”i  ’ 

+  a.  x  >  d.  , 
in  n  ■  i 


V  i  e  I+ 

V  i  e  I' 

V  i  e  1° 
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we  can  see  that  each  inequality  in  (2.4)  of  the  form 


Ai 


is  the  sum  of  two  inequalities  in  (2.10)  and,  since  the  inequalities  of  (2.10) 
are  positive  scalar  multiples  of  some  of  those  of  (2.1),  we  have  proved  our 
assertion. 

This  means  that  the  elimination  of  x^  ,  if  it  does  generate 
inequalities,  is  equivalent  to  premultiplying  by  a  matrix  the  system 

which  we  had  obtained  after  eliminating  x^  ^  .  The  system  we  obtain  after 
eliminating  x^,X2«  . . .  ,  x^  is  therefore 

MJ  •  M^_1  •  ....  •  M2  •  M1  •  Ax  +  Mj  •  ...  •  M2  •  M1  •  By 
>  MJ  •  ...  •  M2  •  M1  •  d 


The  matrices  have  a  special  form  and  may  be  constructed  easily  and 

in  an  obvious  manner  from  the  column  of  coefficients  of  x^  ,  the  variable 
about  to  be  eliminated.  Let 


3lj 

a2j 

a3j 


be  the  column  of  coefficients  of  x^  before  elimination.  A 

is  constructed  from  each  pair  of  elements  a^'j  and  ai''j  * 

is  negative  and  the  other  positive.  The  row  (m  ,  ,m  „,m  . 

rl  r2  r3 


row  r  of 
one  of  which 
..)  has  the 


form 
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IVjl-1  if 

I ai • »  j I  if 


k  -  i’ 


k  -  1' 


otherwise 


there  Is  also  a  row  s  of  MJ  corresponding  to  each  element  a^  which  is 
zero.  In  this  case 


it  -  i 


otherwise 


For  example  if  the  coefficients  of  before  elimination  are 


i 

A 
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We  are  now  in  a  position  to  state  the  following  definition  and  prove 
the  subsequent  lemma: 

Definition  of  F 

Apply  the  Fourier-Motzkin  Method  to  the  elimination  of  x  from  (1.1) 
and  if  the  method  does  generate  inequalities  then  define  F  to  be  the  set 
of  all  rows  of  the  matrix 

M11  •  M11'1  •  _  *  M2  •  M1 

Alternatively  if  no  Inequalities  are  generated  define  F  to  be  the  vector 

w  =  0  . 

(N.B.  F  is  dependent  not  only  on  A  but  also  on  the  order  of  the  variables 
eliminated.  It  is  assumed  in  defining  F  that  this  sequence  is  known.) 

Lemma  2 . 1 

Y  may  be  defined  by  the  system 
(2.11)  wAx  +  wBy  >  wd  ,  V  w  e  f 

Proof : 

The  Fourier-Motzkin  Method  generates  the  system  (2.11)  if  it  generates 
any  inequalities  at  all.  On  the  other  hand  Y  is  unrestricted  and  may  be 
defined  by  the  inequality 

0  •  Ax  +  0  •  By  >  0  •  d 


Q.E.D. 
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3.  THE  KEY  THEOREM 

Consider  the  system  of  inequalities  (1.1). 

Let  S  be  the  set  of  all  nonnegative  linear  combinations  of  the 
inequalities  of  (1.1)  which  annihilate  A  ,  i.e.,  the  set  of  inequalities 

wAx  +  wBy  >  wd  ,  V  w  e  C 

where  C  =  (w  |  wA  =  0  ,  w  >  0)  . 

Also,  let  T  be  the  set  of  inequalities 

wAx  +  wBy  >  wd  ,  V  w  t  F  . 

As  we  showed  ir  the  previous  section  F  is  a  subset  of  C  and  therefore  T 
is  a  subset  of  S  . 

Now  let  U  be  the  set  of  y  which  satisfies  (1.1),  i.e.,  the  set  of 
y  for  which  there  exists  an  x  such  that  (x;y)  satisfies  (1.1);  let  V  be 
the  set  of  y  which  satisfies  S  and  finally  let  W  be  the  set  of  y  which 
satisfies  T  . 

Then  the  following  statements  are  true  whatever  B  and  d  may  be: 

Any  solution  of  a  system  of  linear  inequalities  also  solves  any  non¬ 
negative  linear  combination  of  these  inequalities 

. ' .  U  C  V  . 


Also  S  D  T 


.  * .  V  C  W  . 

And  the  Four ier-Motzkin  Method  implies  that  W  =  U  .  This  proves  the 

Theorem  3. 1 


Whatever  B  and  d  may  be,  U  =  V  =  W  . 
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The  importance  of  this  theorem  should  be  clear  after  reading  the  next 
sections  but  we  may  hint  at  it  now.  Although  the  set  S  may  be  infinite,  we 
only  require  a  relatively  small  number  of  these  inequalities  in  order  to 
adequately  define  U  .  These  inequalities  are  necessarily  a  subset  of  T 
the  inequalities  generated  by  the  Fourier-Motzkin  Method. 
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4.  THE  CONVEX  CONE:  wA  -  0  ,  w  >  0 


Let  C  denote  the  set  of  all  solutions  of 


(4.1) 


wA  ■  0  ,  w  >  0  . 

m 


It  is  Important  to  see  that  if  a  particular  element 

12  3 

linear  combination  of  other  elements  w  ,w  ,w  ,  ... 
corresponding  inequality 


w°  of  C  is  a  nonnegative 
of  C  then  the 


w°By 


o  . 
>  w  d 


is  a  redundant  inequality  in  the  set  S  .  This  is  because  if 


o  ,11^,22^,33, 
w  =  A  w  +  A  w  +  Aw  +  . . . . 


12  3 

for  some  nonnegative  scalars  A  ,A  ,A  ,  ...  then 


w°B  =  A1(w1B)  +  A2(w2B)  +  ... 


and 


w°d  *  A^(w^d)  +  A2(w2d)  +  ... 


Thus  it  is  natural  to  search  for  the  smallest  subset  G  of  C  which 
possesses  the  property  that  every  element  of  C  is  a  nonnegative  linear 
combination  of  the  elements  of  G  .  Theorem  4.2  below  provides  the  information 
we  need  in  order  to  describe  G  but  before  we  can  state  and  prove  it  we  need 
the  following  notation,  definitions  and  lemma: 

(i)  Let  represent  the  row  of  A  .  (A  has  a  finite 

number  of  rows.) 

(ii)  If  w°  is  a  given  m-vector  then  we  define  the  subset  I(w°) 
of  the  indices  {1,2,3,  ...,  m)  as 
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I(w°)  -  {i  |  wj 

i  0}  . 

(lii) 

If 

o 

w 

is  a  vector  in  C 

we  define  the  subset 

K(w°)  of  rows 

of 

A 

as  follows 

K(w°)  -  {f.±  |  i  e 

I(w°)}  . 

(lv) 

We 

will 

use  the  notation 

(w°)  to  represent  the 

set 

{Aw°  |  A  >  0}  . 

(v)  A  let  of  s  rows  of  A  is  said  to  possess  r  dependent  rows 
if  it 

(a)  contains  a  set  of  (s-r)  independent  rows,  and 

(b)  does  not  contain  a  set  of  (s-r+1)  independent  rows. 

(Note  that  the  row  of  all  zeros  alone  constitutes  a  dependent 
set. ) 

(vi)  If  w°  is  a  nonzero  element  in  C  and  if  K(w°)  contains 
exactly  one  dependent  row  then  w°  will  be  called  an  extreme 
vector  of  C  and  the  set  (w°)  will  be  called  an  extreme  half¬ 
line  of  C  . 

(vii)  A  sufficient  set  of  extreme  vectors  of  C  is  a  subset  of  C 
consisting  of  exactly  one  extreme  vector  from  every  one  of  the 
extreme  half-lines  of  C  . 

(viii)  Lemma  4 . 1 

Let  w^  be  a  nonzero  solution  of  wA  **  0  and  let 

X  2 

|  i  e  I(w  )}  contain  exactly  one  dependent  row.  If  w 

is  another  solution  of  wA  ■  0  and  if 

U1  |  i  c  I(w2)}  C  {q  |  i  e  Kw1)} 

2  1 
then  w  is  a  scalar  multiple  of  w 
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Proof : 


Consider  the  system 


(4.2) 


l 

iel(w^) 


Vi  - 0 


where  |  i  e  I(w  ))  contains  one  dependent  row.  If  i° 

is  some  i  in  I(w^)  and  if  we  are  given  the  value  of  w  q 

in  a  solution  of  (4.2)  then  the  values  of  the  other  variables 

required  to  solve  (4.2)  are  precisely  determined. 

2  1 

Therefore  if  w^o  ■  w^  -  kw^o  in  a  solution  of  (4.2),  the 
other  variables  must  be 


w. 


w , 


kw. 


V  i  e  I(w  )  . 


Q.E.D. 


Theorem  4. 2 

If  C  contains  a  nonzero  element  and  if  G  is  a  sufficient  set  of 
extreme  vectors  of  C  then  G  is  finite  and  every  element  of  C  is  a  non¬ 
negative  linear  combination  of  the  elements  of  G  .  Furthermore  if  G'  is 
any  other  subset  of  C  which  has  the  property  that  every  element  of  C  is  a 
nonnegative  linear  combination  of  the  elements  of  G'  ,  then  each  element  of 
G  is  a  positive  scalar  multiple  of  some  element  of  G'  and  therefore  no 
smaller  set  than  G  has  this  property. 

Proof : 

Let  w°  be  any  nonzero  element  of  C  .  K(w°)  must  contain  at  least  one 
redundant  row  since  w°  would  necessarily  be  zero  if  K(w°)  were  an 
independent  set.  If  K(w°)  contains  exactly  one  dependent  row  then  w°  is 
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itself  a  scalar  multiple  of  an  element  of  G  by  Lemma  4.1.  Alternatively, 
if  K(v°)  contains  more  than  one  dependent  row,  we  select  any  row  in  K(w°)  , 
say  ,  and  define  the  subset  I(w°,l)  of  the  indices  {1,2,3,  ....  m)  as 

I(w°,l)  -  U  |  w°  >  0  ,  i  +  1} 
and  then  form  the  following  system  of  linear  equations: 


(4.3) 


y  v  r  ■  ~w°f 

L  i’i  l’l 


iLl(wU,l) 


(Notice  that  {v^  |  i  c  I(w°,l)}  is  a  set  of  variables  but  w°  is  a 
constant . ) 

Demonstrably  (4.3)  has  a  nonnegative  solution  and  the  Simplex  Method  of 
Linear  Programming  can  be  used  to  show  that  if  there  is  a  nonnegative 
solution  of  a  system  of  linear  equations  then  there  is  at  least  one  non¬ 
negative  basics  solution.  Let  {v^  |  i  e  I(w°,l)}  be  such  a  nonnegative  basic 
solution  of  (4.3)  and  define  w  as 


Vf  if 


w°  if 


i  e  I (w  ,1) 


i  =  1 


0  otherwise 


We  next  find  the  largest  (positive)  scalar  k  such  that  (w  -kw)  is  non¬ 
negative  and  we  caa  then  form  w°  as  the  sum 


w°  =  (w°  -  kw)  +  kw 


where 


(i)  w  is  a  scalar  multiple  of  an  element  of  G  ,  and 

(ii)  K(w°  -  kw)  is  a  strict  subset  of  K(w°)  . 
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We  repeat  the  above  process,  using  (w°-kw)  where  above  we  used  w°  , 

until  we  have  expressed  w°  as  a  nonnegative  linear  combination  of  elements 

of  G  .  The  phrase  "if  C  contains  a  nonzero  element"  is  inserted  in  the 

statement  of  the  theorem  in  order  to  ensure  that  G  be  nonempty  and  when 

this  is  the  case  it  is  trivially  obvious  that  the  zero  vector  is  a  nonnegative 

linear  combination  of  the  elements  of  G  .  So  we  have  proved  that  any 

element  of  C  is  a  nonnegative  combination  of  the  elemencs  of  G  . 

G  must  be  finite  because  there  are  only  finitely  many  combinations  of 

rows  of  A  which  have  one  dependent  row. 

Only  the  last  assertion  remains  to  be  proved  and  this  depends  on  the 

fact  that  if  an  element,  say  w^  ,  of  G  is  a  positive  linear  combination  of 

-1  -2  -3 

some  other  elements  w  ,w  ,w  ,  . .  .  of  C  then  these  elements  are  nonnegative 

scalar  multiples  of  w^  .  We  show  this  as  follows: 

Let 


1 

w 


.1-1 

Aw  + 


.2-2  j 

A  w  + 


,  3-3  , 
A  w  + 


-1-2-3  123  1-1  -2  -3 

where  w  ,w  ,w  ,  ...,  c  C  and  A  ,A  ,A  ,  >  0  .  Since  w  ,w  ,w  ,w  ,  .. 

12  3  1 

are  all  nonnegative,  K(w  ),K(w  ),K(w  ),  ...  must  be  subsets  of  K(w  ) 

because  w^  >  0  =>  w^  >  0  .  But  K(w^)  contains  exactly  one  dependent  row 

-1  -2  -3  1 

and  so  by  Lemma  4.1  w  ,w  ,w  ,  ...  must  be  scalar  multiples  of  w  and  any 

set  G'  which  possesses  the  property  described  in  the  statement  of  the 
theorem,  must  therefore  contain  at  least  one  element  from  each  extreme  half¬ 
line.  Since  G  possesses  only  one  element  from  each  of  these  extreme  half¬ 
lines  there  can  be  no  subset  of  C  smaller  than  G  with  this  property. 


Q.E.D. 
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5.  A  CONCISE  DEFINITION  OF  Y 

We  now  have  a  solution.  In  theory  at  least,  to  the  problem  of  finding 
a  concise  definition  of  the  set  Y  which  we  stated  in  Section  1.  We  may 
express  the  solution  as  follows: 

Given  the  system 


(5.1)  Ax  +  By  >  d 
we  examine  the  associated  system 

(5.2)  wA  =  0  ,  w  >  0  . 

If  the  only  solution  of  (5.2)  is  w  5  0  then  the  set  Y  is  unrestricted, 
i.e.,  for  any  y  whatsoever  we  can  always  find  an  x  such  that  (5.1)  is 
satisfied.  Alternatively  if  (5.2)  has  more  than  one  solution  then  the  system 

(5.3)  wAx  +  wBy  =  wBy  >  wd  ,  V  w  e  G 

where  G  is  a  sufficient  set  of  extreme  vectors  of  C  ,  is  a  concise 
definition  of  Y  .  In  fact  if  we  have  no  prior  knowledge  of  B  and  d  then 
there  can  be  no  more  concise  definition  of  Y  than  (5.3) — as  we  can  show  by 

letting  B  =  I  and  d  =  0  .  Then  (5.3)  reduces  to 

wy  >  0  ,  V  w  c  G 

which  clearly  does  not  contain  a  redundant  inequality  because,  by  Farkas' 
Lemma  [6],  a  member  of  i  system  of  homogeneous  linear  inequalities  is 
redundant  if  and  only  if  it  is  a  nonnegative  linear  combination  of  the  other 

inequalities  of  the  system  and  we  have  deliberately  chosen  G  so  that  this 

is  not  the  case. 


In  this  section  we  will  describe  an  adaptation  of  the  Fourier-Mot .kin 
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Method  which  determines  whether  C  has  more  than  one  solution  and,  if  so, 
produces  a  definition  of  Y  in  the  form  (5.3).  It  makes  use  of  the  following 
theorems : 

Theorem  5 . 1 

Let  C  contain  more  than  one  solution  and  let  H  be  a  subset  of  C 
such  that  if  (x;y)  solves  (1.1)  then  it  solves  the  system 

(5.4)  wAx  +  wBy  -  wBy  >  wd  ,  V  w  t  H 

and  if  y  solves  (5.4)  then  there  exists  an  x  such  that  (x;y)  solves 
(1.1),  whatever  B  and  d  may  be. 

Then  H  contains  a  sufficient  set  of  extreme  vectors  of  the  set  C  . 

Proof : 

The  solution  sets  of  the  system  (5.4)  and  of  the  system 

(5.5)  wAx  +  wBy  =  wBy  >  wd  ,  V  w  e  G 

are  identical,  whatever  B  and  d  may  be.  So  we  may  legitimately  allow 

B  and  d  to  take  on  the  special  values  I  and  0  respectively  in  order 
to  examine  the  relationship  between  G  and  H  .  Then  (5.4)  and  (5.5)  reduce 
to 

(5.6)  wy  >  0  ,  V  w  t  H 

(5.7)  wy  >  0  ,  VweG 

respectively.  Each  inequality  of  (5.7)  is  satisfied  by  every  solution  of  the 
system  (5.6)  and  therefore,  by  Farkas'  Lemma,  each  element  of  G  is  a  non¬ 
negative  linear  combination  of  the  elements  of  H  . 

The  required  result  then  follows  as  a  direct  consequence  of  Theorem  4.2. 


Q.E.D. 
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Theorem  5.1  tells  us,  for  example,  that  we  may  restrict  our  search  for  a 
sufficient  set  of  extreme  vectors  to  the  set  F  which  we  defined  at  the  end 
of  Section  2.  The  problem  is  to  distinguish  them  from  the  other  vectors  in 
F  .  Theorems  5.2  and  5.3  below  describe  the  tools  with  which  we  may 
conveniently  do  this. 

Theorem  5.2 

w°  is  an  extreme  vector  of  C  if  and  only  if 

(i)  w°  is  a  nonzero  element  of  C  ,  and 

(ii)  there  does  not  exist  a  nonzero  element  w1  of  C  such  that 
Ifw1)  is  a  strict  subset  of  I(w°)  . 

Proof: 

Assume  that  w°  is  an  extreme  vector  and  that  w1  is  an  element  of  C 

such  that  Ifw1)  is  a  strict  subset  of  I(w°)  .  Then  {E,^  |  i  e  I(w°)} 

contains  one  dependent  row  and  by  Lemma  4.1  w1  is  a  scalar  multiple  of  w°  , 

say  w1  «  kw°  .  But  I(w')  is  a  strict  subset  of  I(w°)  and  therefore 

there  exists  an  i  ,  say  i'  ,  belonging  to  I(w°)  but  not  to  Ifw1)  .  This 

implies  that  w1,  =  kw°,  =  0  which  in  turn  implies  that  w1  must  be  zero. 

Therefore  if  w°  is  an  extreme  vector  there  cannot  be  a  nonzero  element 

of  C  such  that  Ifw1)  is  a  strict  subset  of  I(w°)  . 

Arguing  in  the  opposite  direction,  assume  that  w°  is  a  nonzero 

element  of  C  and  that  there  does  not  exist  a  nonzero  w1  t  C  such  that 

Ifw1)  is  a  strict  subset  of  I(w°)  .  By  Theorem  4.2  every  nonzero  element 

of  C  is  a  positive  linear  combination  of  one  or  more  members  of  some  given 

12  3 

sufficient  set  of  extreme  vectors  {w  ,w  ,w  ,  ...} 

o  ,1-1  .  2-2  .  3-3  ,  1  2  ,3 

w  *  A  w  +  A  w  +  A  w  + .  A, A, A,...  >0 
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-1  -2-3  12 

where  w  ,w  ,w  ,  ...  are  elements  of  {w  ,w  ,  . . . }  .  But  our  assumption, 

o  X  2 

together  with  the  fact  that  w  ,w  ,w  ,  ...  are  nonnegative,  implies  that 
I(w°)  Kw1)  =  l(w2)  =  ...  which  in  turn  implies,  by  Lemma  4.1,  that 
w°,w  ,w2,  ...  are  scalar  multiples  of  one  another.  Therefore  w°  is  an 
extreme  vector  (and  is  in  fact  a  positive  scalar  multiple  of  exactly  one 
element  of  a  sufficient  set  of  extreme  vectors). 

Q.E.D. 

This  theorem  is  useful  in  the  following  way:  We  have  established  that 
F  contains  a  sufficient  set  of  extreme  vectors.  So  we  simply  search  through 
F  discarding  any  vector  w^  if  there  exists  another  vector  w°  in  F 
such  that  l(w°)  is  a  strict  subset  of  I(w^)  .  I  have  been  unable  to  prove 
that  we  can  discount  the  possibility  of  the  Fourier-Motzkin  Method  generating 
more  than  one  extreme  vector  from  the  same  extreme  half-line.  Should  this 
occur  we  need  only  keep  one  of  them. 

Theorem  5. 3 

If  A  contains  n  columns  then  no  extreme  vector  can  contain  more  than 
n+1  nonzero  components. 

Proof : 

If  w°  is  an  extreme  vector  then  { f, ^  |  i  c  I(w°)}  contains  one 
dependent  row  and  a  set  of  no  more  than  n  independent  rows. 

Q.E.D. 

This  means  that  we  may  discard  those  vectors  in  F  which  have  more  than 
n+1  nonzero  components.  It  is  a  simpler  and  quicker  method  than  the  one 
derived  from  Theorem  5.2  but,  unlike  the  latter,  it  may  not  discard  all  of 
the  unwanted  vectors  in  F  . 

It  is  important  to  realize  that  we  do  not  have  to  wait  until  we  have 
eliminated  all  n  variables  x^,X2,  ...,  by  the  Fourier-Motzkin  Method 
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before  we  can  apply  these  methods.  They  may  be  applied  at  any  time  and  as 

often  as  we  like  during  the  elimination  of  the  variables  because,  although 

the  theorems  were  stated  with  regard  to  the  complete  matrix,  they  are  of 

(k) 

course  equally  true  when  applied  to  a  matrix  A  containing  any  k 
columns  of  A  ,  for  k  *  1,2,3,  n  (or  any  other  matrix  for  that 

matter) . 

Now  we  can  describe  the  method  for  finding  a  sufficient  set  of  extreme 
vectors.  It  goes  as  follows: 

Beginning  with  the  system 

(5.8)  Ax  +  By  >  d 

we  eliminate  x^  using  the  Fourier-Motzkin  Method  and  if  this  generates  any 
inequalities  at  all  it  is  equivalent  to  premultiplying  (5.8)  by  a  matrix 
,  as  we  showed  in  Section  2.  (If  the  Fourier-Motzkin  Method  generates  no 
inequalities  at  this  or  a  later  stage  we  deduce  at  once  that  Y  is 
unrestricted.)  We  examine  the  rows  of  M^  to  see  if  there  are  any  which  do 
not  belong  to  a  sufficient  set  of  extreme  vectors  of  the  convex  cone 


(w1iW2, 


w  ) 
m 


w.  >  0  , 
i  = 


i  =  1.2, 


m 


using  the  criteria  of  Theorems  5.2  and  5.3.  Actually  each  row  of  M  is  a 
member  of  a  sufficient  set  of  extreme  vectors,  whatever  A  may  be,  because 
a  row  of  M^  contains  either  one  nonzero  element  which  is  the  only  nonzero 
element  in  its  column  or  a  unique  combination  of  two  nonzero  elements,  and 


« 


therefore  each  row  of  M  is  an  extreme  vector  by  Theorem  5.2.  We  define 
to  be  and  proceed  to  the  elimination  in  turn  of  the  other  variables. 

The  procedure  for  X2»x^f  ...  etc.  is  the  same  for  each  and  we  will  describe 
it  for  the  general  case,  : 


-1-1  -1-2  -2  -1 
(MJ  •  MJ  •  _  •  M  •  M  )  •  Ax  + 


(5.9) 


rSJ-1  . 


2  r.  1 


_  '  M  •  M  )  •  By  >  (M‘ 


>  (M^_1 


...  •  M1)  •  d 


is  the  system  we  have  obtained  by  eliminating  x^jXj,  ...»  x^  ^  .  Actually 

it  is  more  convenient  not  to  carry  this  system  explicitly.  We  only  need  the 
-1-1  -2  -1 

matrix  (M  *....•  M  •  M  )  and  the  original  A  ,  B  and  d  and  we  can 
obtain  updated  columns  as  we  require  them.  The  column  of  (5.9), 


( 1  • 


-2  -1 
M  •  M  )  a 


is  computed  and  the  Fourier-Motzkin  Method  is  applied  to  eliminate  Xj  from 
(5.9).  If  this  generates  any  inequalities  it  is  equivalent  to  premultiplying 


(5.9)  by  a  matrix  MJ  giving 


(5.10) 


MJ(Mj  1  •  -  •  M1)Ax  +  •  _  •  M1)  By 


1  • 


-2  -1 
•  M  '  M  )d 


By  Theorem  5.1  the  rows  of  the  matrix 


Mj(Mj“1  •  MJ"2  •  _  •  M2  *  M1) 


contain  a  sufficient  set  of  extreme  vectors  of  the  convex  cone 
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(w  . . w  ) 

l  i  m 


all  a12  alj 


a21  a22  * 


2j 


3  i  3  n  ••••  a  - 

,  ml  m?  mjj 


Wi  >  0  , 


-  0 


1  m  1 1 2 1  •  •  •  |  m 


and  therefore  we  can  apply  the  methods  of  Theorem  5.2  or  Theorem  5.3  or  both 


in  order  to  detect  them.  If  a  row  of  (M^  ^ 


M2  •  M^)  is  detected 


to  be  not  a  member  of  a  uui f icient  set  of  extreme  vectors  then  the  corres¬ 
ponding  row  of  is  deleted.  The  result,  after  all  deletions  have  been 

made  to  ,  is  defined  to  be  the  matrix  .  This  completes  the 

elimination  of  x^  and  we  are  ready  to  repeat  the  process  on  the  variable 


Xj+1  ‘ 

A 

generated  inequalities  at  every  elimination),  the  rows  of  the  matrix 


At  the  end,  when  x  has  been  eliminated  (assuming  that  the  method  has 

n 


M*1  •  M* 


rJi-1 


M2  •  M1 


constitute  a  sufficient  set  of  extreme  vectors  of  the  set  C  . 

Some  remarks,  mostly  concerning  computation,  are  relevant  at  this  point. 

(1)  Computational  experience  seems  to  indicate  that  it  is  preferable 
to  apply  the  method  of  Theorem  5.3  after  the  elimination  of  each 
variable  in  order  to  delete  rows  of  and  to  use  the  method  of 

Theorem  5.2  periodically,  say  after  every  10  eliminations  or  when 
the  number  of  rows  remaining  after  applying  the  former  method  is 
still  large.  This  is  recommended  because  the  criterion  of 
Theorem  5.3  is  so  easy  to  apply  and  seems  to  eliminate  most  of  the 
unwanted  rows.  However  the  work  involved  in  the  method  of 
Theorem  5.2  is  also  quite  small — especially  when  programmed  for  a 

| 
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computer — because  binary  bits  may  be  used  to  indicate  whether  a 
component  of  a  vector  is  positive  or  not  and  most  scientific 
computers  have  logical  com  •>nds,  such  as  AND,  OR,  EXCLUSIVE  OR, 
which  would  facilitate  the  comparisons  of  the  I(w)'a  . 

(2)  It  is  not  necessary  to  calculate  the  actual  values  of  the  elements 
of  the  matrix 


mV1 


m^2 


m2 


M1) 


before  rows  are  deleted  from  .  The  criteria  of  Theorems  5.2 
and  5.3  both  rely  only  on  whether  elements  are  zero  or  not  zero. 
The  values  of  the  elements  should  only  be  calculated  for  . 

In  fact,  as  the  rows  of  are  created,  a  test  should  be  made  to 

see  whether  they  can  be  deleted  immediately.  If  so  then  it  is 
unnecessary  to  keep  them  of  course. 


Example 

Eliminate  x  from  the  system 
(5 . 11)  Ax  +  By  >  d 

where 

"0  11  7' 

2  2  0  -3 

.  1-10-1 
A  = 

-12  0  3 

9 

-3  0  0  t 

-1-20  1 


''it  is  not  necessary  for  this  purpose  to  know  what  B  and  d  are). 
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(i)  Elimination  of 

Column  1  of  A  is 


and  so 


(ii)  Elimination  of 

Premultiplying  the  second  column  of  A  by  we  get 


and  so 
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The  strictly  positive  elements  of  are  located  as 

follows : 


+  + 
+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 


+ 

+ 

+ 

+  + 

+  + 

+ 

+ 

+  + 

+  + 

+  + 


-  DELETE 
<-  DELETE 


<-  DELETE 
♦-  DELETE 
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By  Theorem  5.3  we  can  delete  rows  5,  6,  9  and  10  of  m  because 
they  contain  four  positive  elements.  The  remaining  rows  form  a 
sufficient  set  of  extreme  vectors  as  can  be  seen  by  applying 
Theorem  5.2 


(iii)  Elimination  of 

-2  -1 

Premultiplying  the  third  column  of  A  by  M  •  M  we  get 


31 


This  is  all  nonnegative  and  so  we  can  delete  the  first  three 
inequalities  of  the  system 

-2-1  -2-1  -2-1 
M  M  Ax  +  M  M  By  >  M  M  d 


(iv)  Elimination  of 

-3  -2  -1 

Premultiplying  the  fourth  column  by  M  •  M  •  M  we  get 
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the  positive  elements  of  M^(M3H^M1)  are 


0+0 
+  +  + 


+  +  0 

+  +  + 


+  + 


DELETE 


DELETE 

DELETE 


By  Theorem  5.2  we  can  delete  rows  2,  4  and  5 


•.  ma  = 


1  0  0  0  0 


0  0  1  y  o 


mam3m2m1 


h  0 


1 
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t 


i 


and  the  set  Y  may  be  defined  by  the  two  inequalities 
(M4M3M2M1)By  >  (mVm^H 

The  unmodified  Four ier-Motzkin  would  have  generated  16 
inequalities  had  it  been  applied  to  this  problem. 


% 


I 
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6.  COMPUTATIONAL  RESULTS 

Some  tests  have  been  made  to  compare  the  method  with  the  Fourier-Motzkin 
Method  and  the  test  problems  are  given  below.  Following  a  short  description 
of  each  problem  there  are  two  rows  of  figures,  e.g., 

12  25  154 
12  10  8 

The  first  of  these  rows  gives  the  number  of  inequalities  generated  by  the 
Fourier-Motzkin  Method  after  each  elimination,  i.e.,  in  this  case  12 
Inequalities  are  generated  when  x^  is  eliminated,  25  when  x^  is  eliminated, 
etc.  The  second  row  gives  the  number  of  inequalities  generated  by  the  method 

i k 

described  in  Section  5.  An  asterisk,  e.g.  1632  ,  indicates  that  1632  is  a 
lower  bound  and  the  actual  number  is  almost  certain  to  exceed  1632. 


Problem 

_1: 

7  Rows 

12 

25 

154 

12 

10 

8 

Problem 

_2: 

15  Rows 

14 

19 

46  425 

14 

19 

26  31 

Problem 

2: 

9  Rows 

8 

9 

14 

41  112 

8 

9 

10 

9  9 

Problem 

_4: 

10  Rows 

10  9  17  3  2 


10  9  15  3  2 
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Problem  5:  15  Rows 


1A 

19 

39 

38 

56 

1A 

19 

2A 

23 

32 

Problem  6:  20  Rows 


20 

23 

21 

20 

26 

53 

A9A 

20 

22 

20 

19 

23 

AA 

176 

and  on  the  next  elimination  the  number  of  inequalities  exceeded  186 — the 
capacity  of  the  machine  for  this  problem. 

Problem  7:  20  Rows 


19 

38 

136 

177 

19 

38 

90 

106 

and  on  the  next  elimination  the  number  of  inequalities  exceeded  502 — the 
capacity  of  the  machine  for  this  problem. 


Problem  8:  20  Rows 


29 

28 

A8 

170 

A 

1632 

5810* 

12391* 

27930 

29 

28 

A2 

97 

195 

22A 

619 

288 
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7.  COMPARISON  WITH  OTHER  METHODS 

There  are  other  methods  for  finding  extreme  vectors  or  points  of  a 
convex  polyhedral  cone  or  set,  the  main  ones  being  Balinski's  method  [1]  and 
Motzkin's  Double  Description  Method  [12]  (see  Balinski  [1]  for  a  discussion 
of  the  literature).  The  remainder  of  this  section  will  be  concerned  with  a 
comparative  analysis  of  these  two  methods  and  will  assume  a  detailed 
knowledge  of  them. 

Motzkin's  method  has  been  virtually  ignored — perhaps  because  he  simply 
stated  a  method  with  some  motivation  but  no  proof.  The  first  part  of  the 
paper  is  devoted  to  a  specialization  of  the  Double  Description  Method  (to 
the  problem  of  finding  all  solutions  of  a  two-person  zero-sum  game)  which  was 
arrived  at  independently  by  laiffa,  Thompson  and  Thrall  who  co-authored  the 
paper.  The  general  Double  Description  Method  is  given  in  the  last  few  pages 
which  were  written  by  Motzkin. 

Although  reference  is  made  to  Notes  of  a  Seminar  on  Linear  Programming 
at  the  Institute  for  Numerical  Analysis,  these  contain  little  additional 
information.  Motzkin  assumes  that  the  reader  shares  his  masterly  under¬ 
standing  of  polyhedral  cones — consequently  some  passages  may  require  careful 
examination  before  their  meaning  becomes  clear.  Once  understood,  however,  the 
paper  provides  profound  insight  not  only  into  the  nature  of  polyhedral  cones 
and  the  Double  Description  Method  but  also  into  the  method  which  I  have 
developed  independently  in  this  thesis.  Motzkin  describes  three  versions  of 
his  method: 

(i)  The  general  problem  Ax  >  0  .  This  method  is  very  similar  to 
mine  in  that  linear  constraints  are  added  one  at  a  time.  The 
main  difference  is  in  the  formation  of  a  new  extreme  vector  of 
the  cone  A^x  >  0  ,  i  =  1,2,  ...,  k  as  a  convex  combination  of 
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two  extreme  vectors  of  the  cone  A^x  >  0  ,  i  ■  1,2,  ....  k-1  . 
Motzkin  checks  first  to  see  whether  the  latter  two  vectors  are 
adjacent  before  generating  the  new  vector.  Adjacency  is  a 
necessary  and  sufficient  condition  that  the  new  vector  be 
extreme.  In  my  method  new  vectors  are  formed  from  all  pairs  of 
the  old  extreme  vectors  which  are  on  opposite  sides  of  the 
hyperplane  A^x  *  0  because  we  have  a  "quick  and  dirty"  means, 
in  the  form  of  the  criterion  of  Theorem  5.3,  of  eliminating 
nonextreme  vectors.  Then  the  criterion  of  Theorem  5.2  is  used 
to  remove  any  remaining  redundant  vectors.  It  is  difficult 
say  which,  of  Motzkin' s  and  my  methods,  is  the  more  efficient. 

Possibly  an  improvement  over  both  would  be  to  order  the  extreme 

vectors  in  some  lexicographical  manner,  dependent  upon  the 
position  of  nonzero  elements,  so  that  we  could  have  a  "quick 
and  dirty"  means  of  determining  adjacency. 

(ii)  The  special  case  Ax  >  0  ,  x  >  0  where  A  is  "nondegenerate." 

Motzkin  states  at  the  top  of  Page  70  of  his  paper  [12]  that  by 

"nondegenerate"  in  this  context  he  means  that  "no  n+1  of  the 
(inhomogeneous)  linear  functions  vanish  at  the  same  point,  which 
is  always  the  case  after  a  small  change  of  the  coefficients." 

By  this  I  am  sure  he  means  that  no  n  (or  more)  of  the  nrt-n 
functions 


xj  *  J  ~  1,2 . n 

A^x  ,  i  ■  1 ,2  ,  . . . ,  m 

may  vanish  at  the  same  point,  i.e.,  each  solution  (x,s)  of  the 


system 
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x  >  0  ,  s  >  0  , 

Ax  +  Is  -  0 

must  have  more  than  ((m-t-n)-n)  ■  m  nonzeros.  Now  this 
condition  of  nondegeneracy  would  mean  that  if  we  were  to  apply 
our  method  to  this  problem  it  would  be  unnecessary  to  use  the 
criterion  of  Theorem  5.2  because  the  criterion  of  Theorem  5.3 
would  eliminate  all  of  the  redundant  vectors;  and  in  fact  this 
version  of  the  Double  Description  Method  is  a  variant  of  our 
method  without  Theorem  5.2' s  criterion. 

(iii)  Ax  >  b  ,  x  >  0  ,  A  nondegenerate  and  A  >  0  .  Although 

Motzkin  doesn't  say  so  the  condition  A  >  0  is  necessary  for 
the  computational  method  which  he  gives  at  the  very  end  of  his 
paper.  Without  it  he  could  not  assume  as  he  does,  that 

(a)  In  Step  s  A  ,  P  k  0  *  0  implies  that  >  0  . 

(b)  Ax  ■  0  has  no  nonzero  solution. 

Incidentally,  it  is  interesting  to  note  that  in  his  1936  thesis  Motzkin 
was  within  a  hair  of  discovering  the  relaticrship  between  Fourier's  method 
and  his  own  Double  Description  Method  but  made  an  error  in  the  logic  of 
Section  86  (assuming  that  (vG^)p^  is  identical  to  v(G^p^))  .  We  can  see 
now  that  the  Double  Description  Method  and  my  method  both  become,  in  essence, 
Fourier's  method  if  new  vectors  generated  by  two  old  notfadjacent  vectors  are 
not  eliminated  at  each  iteration. 

Balinski's  method  is  not  strictly  comparable  with  our  method  since  it 
is  designed  to  find  only  the  vertices  (i.e.,  nonhomogeneous  extreme  vectors) 
of  the  system  Ax  >  b  where  the  columns  of  A  are  linearly  independent.  By 
putting  this  system  in  the  form 


T 
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« 


* 


Ax'  -  Ax"  -  Is  -  bz  -  0 

(7.1) 

x'  ,  x"  ,  8  ,  Z  >  0 

we  see  that  our  method  could  solve  Balinski's  problem  (merely  select  those 
extreme  vectors  In  which  z  >  0)  .  However,  Balinski's  method  does  not  find 
homogeneous  solutions  and  sc  it  does  not  directly  solve  our  problem.  I  feel 
it  can  be  modified  to  do  this  but  has  not  been  to  date.  Ballnski  has 
written  a  computing  code  for  the  special  problem 

Ax  >  b  ,  x  >  0 

and  an  attempt  was  made  to  compare  efficiencies  with  actual  problems  using  a 
computer.  The  tests  were  inconclusive  as  we  will  show  below. 

Test  Problem  1 

When  stated  in  the  form  (7.1)  the  A  matrix  of  this  problem  had  IS 
columns  and  4  rows.  The  CDC  6400  computer  was  used  and  it  required  2. 359 
seconds  to  produce  the  248  extreme  vectors  using  my  code.  82  of  these  were 
vertices.  Balinski's  code  on  the  same  machine  required  5.655  seconds  and 
1503  points  (Iterations)  were  examined.  However,  there  was  some  degeneracy 
which  caused  some  vertices  to  be  generated  many  times  by  Balinski's  code. 
Altogether  his  code  generated  126  solutions  which  it  classed  as  "vertices;" 

44  out  of  the  126  were  duplicates.  So  although  Balinski's  code  required 
more  time  it  was  in  effect  doing  unnecessary  work.  The  problem  of  degeneracy 
is  a  significant  one  for  Balinski's  method  because  most  practical  problems 
are  degenerate  and,  although  this  car  be  eliminated  by  perturbing  the 
coefficients  slightly,  my  experience  has  been  that  this  increases  the  number 
of  extreme  vectors  (and  hence  vertices)  enormously. 
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Test  Problem  2 

A  had  11  columns  and  8  rows.  My  code  required  20  seconds  to  generate 
the  336  extreme  vectors  of  which  90  were  vertices.  Balinski's  code  required 
285  seconds  and  45,370  iterations  to  solve  the  problem.  However,  again 
there  was  much  duplication  of  vertices — but  not  caused  by  degeneracy  this 
time.  Since  theoretically  vertices  cannot  be  repeated  I  conclude  that  there 
is  an  error  in  Balinski's  code. 

In  conclusion,  the  indication — not  definite — is  ’■''at  Balinski's  method 
is  less  efficient  for  generating  all  vertices.  However,  there  are  certain 
problems  in  which  one  is  looking  for  one  vertex  with  certain  characteristics 
and  for  this  Balinski's  method  seems  well  suited  because  it  passes 
progressively  from  one  verte.;  to  another. 
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8.  APPLICATIONS 

(1)  Linear  Programming  With  Varying  Cost  Coefficients 
Consider  the  problem 


(8.1) 


Minimize  cx 

Subject  to  Ax  =»  b  ,  x  >  0 


which  is  to  be  solved  many  times  (say  every  day)  with  fixed  A  and  b  but 
varying  c  .  We  can  find  the  set  of  all  extreme  points  of  Ax  ■  b  ,  x  >  0 
by  finding  all  extreme  vectors  of  the  cone 


x  >  0  ,  X  >  0  , 

B  '  a 

Ax  -  bX  =0  , 


using  our  method,  and  discarding  those  extreme  vectors  for  which  X  *  0  . 

Then  the  linear  program  may  be  solved  by  simply  choosing  the  extreme  vector 
x  which  yields  the  smallest  cx  .  This  method  is  suited  to  a  problem  of  this 
sort  because,  although  considerable  work  is  involved  in  generating  all  extreme 
points,  this  need  only  be  done  once. 

(ii)  Linear  Programming  With  Varying  Right-Hand  Sides 
Consider  the  problem 

Minimize  cx 

Subject  to  Ax  >  b  ,  x  >  0 

but  this  time  we  will  assume  that  A  and  c  remain  fixed  while  b  varies. 
Then  by  eliminating  x  from  the  system 

lx  >  0  , 

Ax  >  b  , 

-cx  +  z  >  0  , 
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we  can  solve  all  subsequent  problems  very  simply.  To  be  specific  we  need  to 
do  the  following: 

(a)  Obtain  all  extreme  vectors  (s^,w*,aS  ,  i  •  1.2,  ...  of  the  cone 

si  +  wA  -  Ac  =  0  , 
s  ,  w  ,  A  >  0 

for  which  A*'  >  0  . 

(b)  For  each  i  compute  and  store  •  w*  ,  i  “  1,2,  ... 

A1 

Then  for  each  b  we  simply  form  the  inequalities 

z  >  “■  ■  w^b  ,  i  *■  1,2,  ... 

A1 

and  choose  the  i  which  gives  the  largest  value  of  •  w*b  .  The 

A  ^ 

A 

corresponding  value  of  x  may  be  formed  using  the  method  of  (iii)c  below. 


Finding  All  Solutions  Of  A  Linear  Program 


This  can  be  achieved  in  at  least  three  ways 

(a)  By  use  of  (i)  above  and  by  selection  of  all  extreme  vectors  x 
which  yield  the  minimum  value  of  cx  . 

(b)  Solve  the  problem  first  by  the  simplex  method  giving,  say, 

Min  cx  =  z  and  then  find  all  extreme  points  of  the  polyhedral 
o 

set 


x  >  0  , 
Ax  =  b  , 
cx  =  z 

o 


(c)  Eliminate  x  from  the  system 


A3 


lx  >0 

(8.2)  Ax  >  b 

-cx  +  z  >  0 

s 

yielding  a  system  of  inequalities  in  the  variable  z  ,  of  the 
form 


(8.3)  z  >  wb 

where  w  is  an  extreme  point  of  the  polyhedral  set 


-c  +  wA  <  0 
w  >  0 

(i.e.,  if  (w°,w,s)  is  an  extreme  vector  of  the  polyhedral  cone 

-w°c  +  wA  +  si  "  0  , 
w°  >  0  ,  w  >  0  ,  s  >  0 

we  select  only  those  extreme  vectors  for  which  w°  >  0  and  insist, 
for  them,  that  w°  be  equal  to  1).  Now  let  zq  be  the  minimum 
value  of  z  .  It  will  satisfy  some  of  the  inequalities  (8.3)  as 
equalities.  Taking  each  of  these  equalities 


z  =  w'  b 
o 

in  turn  we  notice  that  it  is  a  positive  linear  combination  of  some 
of  the  equalities  of  (8.2)  which  must  therefore  also  be  satisfied 
as  equalities  by  the  optimal  value  of  x  and  z^  .  Now  (w°,w,s) 

.s  an  extreme  point  and  therefore  the  subsystem  of  equations  contains 
no  redundant  row,  is  nonsingular  and  may  be  solved  by  inverting  its 


matrix  of  coefficients. 
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Incidentally,  logic  similar  to  the  above  may  be  used  to  derive  the 

v 

Duality  Theorem  of  Linear  Programming. 

t 

(iv)  Finding  All  Solutions  Of  Bimatrix  Games 

0.  L.  Mangasarian  [10]  has  given  a  method  for  finding  the  set  of  all 
equilibrium  points  of  bimatrix  game.  His  method  uses  Baiinski's  algorithm 
for  finding  all  vertices  of  a  convex  polyhedral  set.  Our  method  may  be 
substituted  for  Baiinski's  in  this  context. 


(v)  Bilinear  Programming 

As  Mangasarian  and  Stone  showed  [9] ,  finding  an  equilibrium  point  of  a 
bimatrix  game  is  equivalent  to  finding  a  solution  of  a  certain  type  of  bi¬ 
linear  program.  The  general  bilinear  programming  problem  may  be  stated  as 


(8.5) 


Minimize  xCy  +  px  +  qy 
Subject  to  Ax  =  b 
By  =  d 

x  >  0  ,  y  >  0 


and  it  may  be  solved  by  two  methods  at  least: 

(a)  As  is  well  known,  there  is  a  solution 
x°  is  an  extreme  point  of 


(x°,y°)  of  (8.5)  such  that 


V 


(8.6)  Ax  =  b  ,  x  >  0 
and  y°  is  an  extreme  point  of 

(8.7)  By  »  d  ,  y  >  0  . 

And  so  we  can  take  all  possible  pairs  of  extreme  points,  one  of 
(8.6)  and  the  other  of  (8.7),  and  choose  the  pair(s)  which  give  the 
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minimum  value  of  xCy  +  px  +  qy  .  This  is  basically  what 
Mangasarian  did  (see  (iii)  above). 

(b)  (K  Murty  is  responsible  for  the  following  idea:)  In  (8.5)  let  us 
assume  for  the  moment  that  we  have  fixed  x  and  we  want  to  choose 
the  optimal  y  .  (8.5)  then  becomes  a  linear  program  whose  dual  is: 

Maximize  z 
u 

Subject  to  ud  -  z  »  0 

(8.8)  uB  -  xC  <  0 

Ax  =  b 

x  >  0  . 

3 

Now  for  fixed  x  ,  the  system  (8.8)  defines  a  convex  set  in  u  and 

z  which  means  that  Max  z  is  a  concave  function  of  x  .  The 

u 

problem  (8.5)  therefore  becomes  Minimize  {Max  ud  |  uB  -  xC  <  0  , 

x  u 

Ax  =  b  ,  x  >  0}  that  is  to  say,  a  problem  of  minimizing  a  concave 

function,  (Max  ud  |  uB  -  xC  <  0)  ,  over  a  convex  set  (Ax  =  b  , 

u  “ 

x  >  0)  .  One  approach  to  this  problem  is  to  first  eliminate  u 
from  the  system  (8.8)  giving  a  system  in  z  and  x  ;  then  find  the 
extreme  points  of  Ax  =  b  ,  x  >  0  substituting  each  time  to  find 
the  minimum  possible  value  of  z  . 

(vi)  Minimizing  A  Concave  Function  Over  A  Convex  Polyhedral  Set 

Mangasarian  in  his  forthcoming  book  on  Nonlinear  Programming  gives  the 
following  theorem  (Section  5.1,  Theorem  8): 

Let  R  be  convex  and  let  g(x)  be  concave  on  R  .  If  g(x)  is  not 
constant  on  R  then  no  interior  point  of  R  can  solve  the  problem 

Minimize  g(x)  . 
xeR 
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This  implies  that  If  R  Is  defined  by  linear-  constraints  and  If  8<x)  Is 
not  constant  then  the  minimal  value  is  achieved  at  an  extreme  point.  Therefore 
our  method  can  again  be  used  to  generate  all  extreme  points  and  thereby  solve 

this  problem. 
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